Planning with Closed-Loop Macro Actions

نویسندگان

  • Doina Precup
  • Richard S. Sutton
  • Satinder Singh
چکیده

Planning and learning at multiple levels of tempo ral abstraction is a key problem for arti cial intelli gence In this paper we summarize an approach to this problem based on the mathematical framework of Markov decision processes and reinforcement learn ing Conventional model based reinforcement learning uses primitive actions that last one time step and that can be modeled independently of the learning agent These can be generalized to macro actions multi step actions speci ed by an arbitrary policy and a way of completing Macro actions generalize the classical no tion of a macro operator in that they are closed loop uncertain and of variable duration Macro actions are needed to represent common sense higher level actions such as going to lunch grasping an object or travel ing to a distant city This paper generalizes prior work on temporally abstract models Sutton and ex tends it from the prediction setting to include actions control and planning We de ne a semantics of mod els of macro actions that guarantees the validity of planning using such models This paper present new results in the theory of planning with macro actions and illustrates its potential advantages in a gridworld task

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

PUMA: Planning Under Uncertainty with Macro-Actions

Planning in large, partially observable domains is challenging, especially when a long-horizon lookahead is necessary to obtain a good policy. Traditional POMDP planners that plan a different potential action for each future observation can be prohibitively expensive when planning many steps ahead. An efficient solution for planning far into the future in fully observable domains is to use temp...

متن کامل

Roles of Macro - Actions in Accelerating Reinforcement

We analyze the use of built-in policies, or macro-actions, as a form of domain knowledge that can improve the speed and scaling of reinforcement learning algorithms. Such macro-actions are often used in robotics, and macro-operators are also well-known as an aid to state-space search in AI systems. The macro-actions we consider are closed-loop policies with termination conditions. The macro-act...

متن کامل

Decentralized control of multi-robot partially observable Markov decision processes using belief space macro-actions

This work focuses on solving general multi-robot planning problems in continuous spaces with partial observability given a high-level domain description. Decentralized Partially Observable Markov Decision Processes (DecPOMDPs) are general models for multi-robot coordination problems. However, representing and solving DecPOMDPs is often intractable for large problems. This work extends the Dec-P...

متن کامل

Mathematical Formulation and Solving of Green Closed-loop Supply Chain Planning Problem with Production, Distribution and Transportation Reliability

In this paper, developed a new multi-product, multi-period, and multi-level closed-loop green supply chain planning model under uncertain conditions. The formulated model consists of five objective functions, which minimize the cost of the supply chain, minimize the CO2 emission of transportation vehicles, maximize the reliability of manufacturing and distribution centers, maximize t...

متن کامل

Sustainable closed-loop supply chain network design and operations planning considering human resource employment and training

Modeling and optimal solving of supply chain management problems lead to efficient decision making in strategic planning and supply chain operations, resulting in a competitive advantage. Today, with the planning of a sustainable supply chain, in addition to achieving economic goals, it is possible to meet social and environmental objectives and considerations. This research deal with sustainab...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997